Developments on Monte Carlo Go
نویسندگان
چکیده
We have developed two go programs, Olga and Oleg, using a Monte Carlo approach, simpler than Bruegmann’s [Bruegmann, 1993], and based on [Abramson, 1990]. We have set up experiments to assess ideas such as progressive pruning, transpositions, temperature, simulated annealing and depth-two tree search within the Monte Carlo framework. We have shown that progressive pruning alone gives better results than Bruegmann’s Monte Carlo go, which uses transpositions, temperature and simulated annealing. Nevertheless, transpositions and temperature are good speed up enhancements that do not lower the level of the program too much. Moreover, the results of our Monte Carlo programs against knowledgebased programs on 9x9 boards and the ever-increasing power of computers lead us to think that Monte Carlo approaches are worth considering for computer go in the future.
منابع مشابه
Monte-Carlo Go Developments
We describe two Go programs, and , developed by a Monte-Carlo approach that is simpler than Bruegmann’s (1993) approach. Our method is based on Abramson (1990). We performed experiments to assess ideas on (1) progressive pruning, (2) all moves as first heuristic, (3) temperature, (4) simulated annealing, and (5) depth-two tree search within the Monte-Carlo framework. Progressive pruning and the...
متن کاملMonte Carlo Go Has a Way to Go
Monte Carlo Go is a promising method to improve the performance of computer Go programs. This approach determines the next move to play based on many Monte Carlo samples. This paper examines the relative advantages of additional samples and enhancements for Monte Carlo Go. By parallelizing Monte Carlo Go, we could increase sample sizes by two orders of magnitude. Experimental results obtained i...
متن کاملMarkov Chain Monte Carlo Methods for Statistical Inference
These notes provide an introduction to Markov chain Monte Carlo methods and their applications to both Bayesian and frequentist statistical inference. Such methods have revolutionized what can be achieved computationally, especially in the Bayesian paradigm. The account begins by discussing ordinary Monte Carlo methods: these have the same goals as the Markov chain versions but can only rarely ...
متن کاملDevelopments on Product Propagation
Product Propagation (PP) is an algorithm to backup probabilistic evaluations for abstract two-player games. It was shown that PP could solve go problems as efficiently as Proof Number Search (PNS). In this paper, we exhibit a few domains where, for generic non-optimized versions, PP performs better than previously known algorithms for solving games. The compared approaches include alpha-beta se...
متن کاملMonte-Carlo Approximation of Temperature
Monte-Carlo tree search is a powerful paradigm for the game of Go. We propose to use Monte-Carlo tree search to approximate the temperature of a game, using the mean result of the playouts. Experimental results on the sum of five 7x7 Go games show that it improves much on a global search algorithm.
متن کامل